How Should a Speech Recognizer Work?
نویسندگان
چکیده
Although researchers studying human speech recognition (HSR) and automatic speech recognition (ASR) share a common interest in how information processing systems (human or machine) recognize spoken language, there is little communication between the two disciplines. We suggest that this lack of communication follows largely from the fact that research in these related fields has focused on the mechanics of how speech can be recognized. In Marr's (1982) terms, emphasis has been on the algorithmic and implementational levels rather than on the computational level. In this article, we provide a computational-level analysis of the task of speech recognition, which reveals the close parallels between research concerned with HSR and ASR. We illustrate this relation by presenting a new computational model of human spoken-word recognition, built using techniques from the field of ASR that, in contrast to current existing models of HSR, recognizes words from real speech input.
منابع مشابه
Can a syntactic language model be incorporated into an HMM-based ASR framework?
Automatic speech recognition (ASR) is the process which a computer interprets human speech into some kind of meaningful representation. Usually, the task for the automatic speech recognition is to identify the correct wordsequence. Speech recognizer is used in different applications for example simple command-control programs, transcription and speech understanding. A good automatic speech reco...
متن کاملEliciting Natural Speech From Non-Native Users: Collecting Speech Data For LVCSR
In this paper, we discuss the design of a database of recorded and transcribed read and spontaneous speech of semiuent, strongly-accented non-native speakers of English. While many speech applications work best with a recognizer that expects native-like usage, others could bene t from a speech recognition component that is forgiving of the sorts of errors that are not a barrier to communication...
متن کاملImpact of different speaking modes on EMG-based speech recognition
We present our recent results on speech recognition by surface electromyography (EMG), which captures the electric potentials that are generated by the human articulatory muscles. This technique can be used to enable Silent Speech Interfaces, since EMG signals are generated even when people only articulate speech without producing any sound. Preliminary experiments have shown that the EMG signa...
متن کاملمدل میکروسکوپی دوگوشی مبتنی بر فیلتر بانک مدولاسیون برای پیش گویی قابلیت فهم گفتار در افراد دارای شنوایی عادی
In this study, a binaural microscopic model for the prediction of speech intelligibility based on the modulation filter bank is introduced. So far, the spectral criteria such as the STI and SII or other analytical methods have been used in the binaural models to determine the binaural intelligibility. In the proposed model, unlike all models of binaural intelligibility prediction, an automatic ...
متن کاملUnsupervised training of an HMM-based speech recognizer for topic classification
HMM-based Speech-To-Text (STT) systems are widely deployed not only for dictation tasks but also as the first processing stage of many automatic speech applications such as spoken topic classification. However, the necessity of transcribed data for training the HMMs precludes its use in domains where transcribed speech is difficult to come by because of the specific domain, channel or language....
متن کاملGender-dependent emotion recognition based on HMMs and SPHMMs
It is well known that emotion recognition performance is not ideal. The work of this research is devoted to improving emotion recognition performance by employing a two-stage recognizer that combines and integrates gender recognizer and emotion recognizer into one system. Hidden Markov Models (HMMs) and Suprasegmental Hidden Markov Models (SPHMMs) have been used as classifiers in the two-stage ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Cognitive science
دوره 29 6 شماره
صفحات -
تاریخ انتشار 2005